Receptive Field Block Net for Accurate and Fast Object Detection
نویسندگان
چکیده
Current top-performing object detectors depend on deep CNN backbones, such as ResNet-101 and Inception, benefiting from their powerful feature representation but suffering from high computational cost. Conversely, some lightweight model based detectors fulfil real time processing, while their accuracies are often criticized. In this paper, we explore an alternative to build a fast and accurate detector by strengthening lightweight features using a crafting mechanism. Inspired by the structure of Receptive Fields (RFs) in human visual systems, we propose a novel RF Block (RFB) module, which takes the relationship between the size and eccentricity of RFs into account, to enhance the discriminability and robustness of features. We further assemble the RFB module to the top of SSD with a lightweight CNN model, constructing the RFB Net detector. To evaluate its effectiveness, experiments are conducted on two major benchmarks and the results show that RFB Net is able to reach the accuracy of advanced very deep backbone network based detectors while keeping the real-time speed. Code is available at https://github.com/ruinmessi/RFBNet.
منابع مشابه
Novelty detection in image recognition using IRF Neural Networks properties
Image Receptive Fields Neural Network (IRF-NN) is a variant of feedforward multi-layer perceptrons adapted to image recognition. It shows very fast training as well as robust and accurate results on supervised classification tasks. This paper presents another property of IRF-NN: responses of trained networks can be analysed to detect unknown images. Several discriminative and efficient novelty ...
متن کاملPii: S0306-4522(98)00620-4
Optokinetic nystagmus is a reflex to stabilize an object image on the retina by compensatory eye movements. In lower vertebrates, the nucleus of the basal optic root participates in generating this reflex. Visual responses of 135 neurons were extracellularly recorded from the nucleus in pigeons and their receptive field properties were analysed on-line with a workstation. These cells could be c...
متن کاملObject - based Postprocessing of Block Motion FieldsFor Video
It is likely that in many applications block-matching techniques for motion estimation will be further used. In this paper, a novel object-based approach for enhancement of motion elds generated by block matching is proposed. Herein, a block matching is rst applied in parallel with a fast spatial image segmentation. Then, a rule-based object postprocessing strategy is used where each object is ...
متن کاملEfficient Video Indexing for Fast-motion Video
Due to advances in recent multimedia technologies, various digital video contents become available from different multimedia sources. Efficient management, storage, coding, and indexing of video are required because video contains lots of visual information and requires a large amount of memory. This paper proposes an efficient video indexing method for video with rapid motion or fast illuminat...
متن کاملDo Convnets Learn Correspondence?
Convolutional neural nets (convnets) trained from massive labeled datasets [1] have substantially improved the state-of-the-art in image classification [2] and object detection [3]. However, visual understanding requires establishing correspondence on a finer level than object category. Given their large pooling regions and training from whole-image labels, it is not clear that convnets derive ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1711.07767 شماره
صفحات -
تاریخ انتشار 2017